AITopics | word-level discriminator

Collaborating Authors

word-level discriminator

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Controllable Text-to-Image Generation

Bowen Li, Xiaojuan Qi, Thomas Lukasiewicz, Philip Torr

Neural Information Processing SystemsFeb-11-2026, 15:36:35 GMT

Also, a word-level discriminator is proposed to providefine-grained supervisory feedback bycorrelating wordswithimageregions, facilitating training an effective generator which is able to manipulate specific visual attributes without affecting the generation of other content. Furthermore, perceptual loss is adopted to reduce the randomness involved in the image generation, andtoencourage thegenerator tomanipulate specific attributesrequired inthemodified text.

artificial intelligence, arxivpreprintarxiv, machine learning, (18 more...)

Neural Information Processing Systems

Country: North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Sensing and Signal Processing > Image Processing (0.91)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.69)

Add feedback

1d72310edc006dadf2190caad5802983-AuthorFeedback.pdf

Neural Information Processing SystemsFeb-11-2026, 15:36:20 GMT

channel-wise attention, effectiveness, word-level discriminator, (16 more...)

Neural Information Processing Systems

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (0.32)
Information Technology > Artificial Intelligence > Vision (0.31)

Add feedback

LightweightGenerativeAdversarialNetworksfor Text-GuidedImageManipulation

Neural Information Processing SystemsFeb-11-2026, 05:11:48 GMT

Methods introduced in [4, 19] both fail toeffectively modify text-required attributes, and results are also farfrom satisfactory (see Figure 1).

artificial intelligence, discriminator, machine learning, (18 more...)

Neural Information Processing Systems

Country:

North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)
Europe > Italy > Calabria > Catanzaro Province > Catanzaro (0.04)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (0.96)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.68)

Add feedback

Controllable Text-to-Image Generation

Bowen Li, Xiaojuan Qi, Thomas Lukasiewicz, Philip Torr

Neural Information Processing SystemsOct-9-2025, 13:23:53 GMT

When the given text description (e.g., colour) is changed,

artificial intelligence, discriminator, machine learning, (17 more...)

Neural Information Processing Systems

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

1d72310edc006dadf2190caad5802983-AuthorFeedback.pdf

Neural Information Processing SystemsOct-2-2025, 07:43:18 GMT

artificial intelligence, channel-wise attention, word-level discriminator, (16 more...)

Neural Information Processing Systems

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (0.32)
Information Technology > Artificial Intelligence > Vision (0.31)

Add feedback

Lightweight Generative Adversarial Networks for T ext-Guided Image Manipulation Bowen Li

Neural Information Processing SystemsAug-17-2025, 09:16:49 GMT

Furthermore, thanks to the explicit training signal related to each word, the discriminator can also be simplified to have a lightweight structure.

artificial intelligence, discriminator, machine learning, (17 more...)

Neural Information Processing Systems

Country:

Europe > United Kingdom > England > Oxfordshire > Oxford (0.04)
North America > Canada (0.04)
Europe > Italy > Calabria > Catanzaro Province > Catanzaro (0.04)
Asia > China > Hong Kong (0.04)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

Reviews: Controllable Text-to-Image Generation

Neural Information Processing SystemsJan-22-2025, 03:35:11 GMT

The paper is well-organized and written, which can be followed easily. In particular, instead of generating a new image from the text, the authors pay more attention to image manipulation based on the modified natural language description. For the word-level spatial and channel-wise attention driven generator: (1) The novelty and effectiveness of attentional generator may be limited. Specifically, the paper designs a word-level spatial and channel-wise attention driven generator, which has two attention parts (i.e. However, since the spatial attention is based on the method in AttnGAN [7], most contributions may lie on the additional channel-wise part.

channel-wise attention, controllable text-to-image generation, mechanism, (12 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Vision (0.85)
Information Technology > Sensing and Signal Processing > Image Processing (0.72)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.40)

Add feedback

Illegible Text to Readable Text: An Image-to-Image Transformation using Conditional Sliced Wasserstein Adversarial Networks

Karimi, Mostafa, Veni, Gopalkrishna, Yu, Yen-Yun

arXiv.org Machine LearningOct-11-2019

Automatic text recognition from ancient handwritten record images is an important problem in the genealogy domain. However, critical challenges such as varying noise conditions, vanishing texts, and variations in handwriting make the recognition task difficult. We tackle this problem by developing a handwritten-to-machine-print conditional Generative Adversarial network (HW2MP-GAN) model that formulates handwritten recognition as a text-Image-to-text-Image translation problem where a given image, typically in an illegible form, is converted into another image, close to its machine-print form. The proposed model consists of three-components including a generator, and word-level and character-level discriminators. The model incorporates Sliced Wasserstein distance (SWD) and U-Net architectures in HW2MP-GAN for better quality image-to-image transformation. Our experiments reveal that HW2MP-GAN outperforms state-of-the-art baseline cGAN models by almost 30 in Frechet Handwritten Distance (FHD), 0.6 on average Levenshtein distance and 39% in word accuracy for image-to-image translation on IAM database. Further, HW2MP-GAN improves handwritten recognition word accuracy by 1.3% compared to baseline handwritten recognition models on the IAM database.

discriminator, handwriting recognition, recognition, (14 more...)

arXiv.org Machine Learning

1910.05425

Country:

North America > United States > Texas > Brazos County > College Station (0.04)
North America > Canada > Alberta > Census Division No. 6 > Calgary Metropolitan Region > Calgary (0.04)

Genre: Research Report (0.82)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Pattern Recognition (0.89)

Add feedback